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Abstract — In distributed storage systems reliability is achieved 
through redundancy stored at different nodes in the network. 
Then a data collector can reconstruct source information even 
though some nodes fail. To maintain reliability, an autonomous 
and efficient protocol should be used to repair the failed node. 
The repair process causes traffic and consequently transmission 
cost in the network. Recent results found the optimal traffic- 
storage tradeoff, and proposed regenerating codes to achieve 
the optimality. We aim at minimizing the transmission cost in 
the repair process. We consider the network topology in the 
repair, and accordingly modify information flow graphs. Then 
we analyze the cut requirement and based on the results, we 
formulate the minimum-cost as a linear programming problem 
for linear costs. We show that the solution of the linear problem 
establishes a fundamental lower bound of the repair-cost. We also 
show that this bound is achievable for minimum storage regener- 
ating, which uses the optimal-cost minimum-storage regenerating 
(OCMSR) code. We propose surviving node cooperation which 
can efficiently reduce the repair cost. Further, the field size for 
the construction of OCMSR codes is discussed. We show the gain 
of optimal-cost repair in tandem, star, grid and fully connected 
networks. 

Index Terms — Distributed storage systems, regenerating codes, 
minimum-cost, surviving node cooperation, linear optimization. 

I. Introduction 

THE increasing number of new applications such as voice 
and video over Internet, video on demands, user gener- 
ated content (like YouTube service) have caused a fast growth 
of traffic over the wired and wireless networks during the last 
decades. This traffic growth requires big data storages. Con- 
sequently, the idea of distributing files among some storage 
nodes on the network has been proposed. High demand in QoS 
(quality of service) in e.g., fast and ubiquitous access, low- 
delay and reliability poses high requirements on distributed 
information storage. Compared to centralized storage systems, 
the benefits of distributed storage systems include fast and 
ubiquitous access, high reliability, availability and scalability. 
For the benefits, recently distributed storage systems have 
been used in big data centers like Google file systems (TJ, 
Oceanstore [2| and Total Recall (3J. 

On another aspect, the distributed storage system has 
brought the higher reliability of the measured data in wireless 
sensor networks 10). Wireless sensor networks consist of 
several small devices (sensors) which measure or detect a 



physical quantity e.g., temperature, pressure, light and so on. 
The main characteristic of the sensors are limited battery, 
low CPU power and limited communication capability Q. 
Because of the limited communication capability of sensors, 
communication between nodes is generally through multi-hop. 
That is, storage nodes may relay the message of other nodes. 

The storage nodes are vulnerable due to disk failure, power 
off, or a node leaving the system. Thus to make the data 
reliable over the unreliable nodes, the data is encoded and 
distributed among storage devices 0, (6), Q. Reliability 
is usually obtained through redundant nodes in a distributed 
storage system, in which error control (EC) codes are normally 
used to increase the storage efficiency. An EC code with the 
MDS (maximum distance separable) property is optimal in 
term of the redundancy-storage tradeoff. If a file of size M bits 
is coded by (n, fc)-MDS codes (k < n) and distributed among 
n nodes, each node stores M/k bits. Then every k nodes can 
reconstruct the original file. Nonetheless, if a node fails, and a 
new node joins the system, the new node needs to download 
M bits to regenerate M/k bits. Thus, it may not be optimal 
considering the traffic for regenerating a new node. In [0, 
(6), the authors investigate the problem of the repairing traffic 
(bandwidth) and find the optimal storage-bandwidth tradeoff. 
A new class of erasure codes, namely regenerating codes based 
on network coding ((8), ||9)X are proposed in 0, |]6] to achieve 
the optimal tradeoff. In the repair process, the new node may 
not have the same encoded data as the failed node. However, 
the restored data has the same MDS property (i. e., with the 
new node still every k nodes can reconstruct the original file). 
We call this property as regenerating code property (RCP). 
This kind of repair is generally called functional repair, in 
contrast to the exact repair, where the new node stores the 
same data as the failed node. The exact regenerating of a new 
node is studied in iflOl. ifTTl. fl2l. 

Two extreme scenarios in the fundamental storage- 
bandwidth tradeoff of regenerating codes [5 1 correspond to 
two types of regenerating codes: one scenario, namely the 
minimum storage regenerating (MSR) code, uses the same 
amount storage space as the MDS code but with lower repair 
traffic. Another scenario, namely the minimum bandwidth 
regenerating (MBR) code has slightly more storage per node, 
but its repair bandwidth is considerably lower than the MSR 
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Fig. 1. A distributed storage system in a 4-node tandem network. Dashed lines denote how network codewords are formed. Node 4 fails and node 5 is the 
new node. For regenerating a new node, pi,P2>P3 are formed by linear combination of fragments in node 1, 2, and 3, respectively. The underlying finite 
field is GF(ll). 



(MDS) codes |5|. To further decrease the repair traffic in 
the scenario of multiple simultaneously-failed nodes, reference 
lfl3l proposes new node cooperative repair. In new node 
cooperative regenerating codes, the new nodes cooperate to 
minimize the repair bandwidth. The difference between new 
node cooperative regenerating and our method is that in 
our approach surviving nodes cooperate, i.e., surviving nodes 
can combine the received data with their own information. 
These two approaches together can provide a fully cooperative 
regenerating process. 

Though references J5], ||6), 0~3], iflOl have well addressed 
the optimal storage-bandwidth tradeoff in distributed systems 
from different aspects, the link cost (transmission cost in 
channels) and the impact of the network topology have not 
been considered. In a practical system, the cost is an im- 
portant design consideration and different links (channels) 
of a network may have different costs. The topology of 
networks may also impact the repair cost. Looking through 
two main applications for distributed storage systems in data 
centers and wireless sensor network, we find that data centers 
have hierarchical network structures [TTTl and wireless sensor 
networks have multi-hop network structures llSTI . Recently, 
reference [14] considers the transmission cost from surviving 
nodes to the new node and the cost-bandwidth tradeoff is 
derived. This model suits for the scenarios in which there are 
direct links from surviving nodes to the new node. Yet, this 
model does not exploit the network structure. For instance, 
consider two storage nodes that have direct links to the new 
node as well as a link between themselves. Suppose these 
surviving nodes want to send data to the new nodes. Instead 
of direct communication, the surviving nodes could cooperate 
and send an aggregated data to the new node. Thus, we 
propose the idea of surviving nodes cooperation in order to 
reduce the transmission cost. This problem is also interesting 
in distributed storage systems in wireless sensor networks 



(WSN), where reducing transmission cost is demanding. In 
1 30 1 it has been shown that the optimal-cost repair can be 
found also in a decentralized approach which suits for WSN 
due to the lack of a central node and CPU power limitation 
of nodes. 

Other related work includes followss. Reference [15] pro- 
poses a tree-structure algorithm for minimum storage regen- 
erating (MSR) codes. The work considers a network having 
links with different bandwidth (capacity). Then by exploiting 
Prim's algorithm, they find a repair approach which minimizes 
the repair time (i.e., finding routes in network with the highest 
capacity to maximize the speed of repair process). Our work is 
distinct from ITT31 since we minimize the repair-cost instead of 
repair-time. Repair-cost can be formulated as the summation 
of costs on all links in network, while repair-time depends on 
the bottleneck of a route. 

We shall study the repair-cost in multi-hop networks where 
links may have different transmission costs. To formulate the 
problem, we modify the information flow graph by taking 
into considering the network topology. Then, by cut-set bound 
analysis and solving a linear programming problem, we derive 
a lower bound of repair costs. In distributed storage systems, 
networks evolve along with time. Thus, there is infinite stages 
of repair. In general, one should run cut-set bound analysis 
in infinite stages to find the optimum cost. We first give a 
lower bound of repair-cost by cut-set analysis in only first 
stage of repair. Later, we prove that this lower bound is 
tight for minimum storage regenerating codes which are the 
optimum-cost minimum storage regenerating codes. To the 
best of our knowledge, we are the first who consider surviving 
node cooperation (SNC) scheme to reduce the repair cost. SNC 
allows intermediate nodes in the network to combine their 
received fragments from other surviving nodes and their own 
stored data. For a general setting, we formulate the minimum- 
cost problem and study the solution. Related work includes 
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[23 1, in which the minimum-cost multicast problem is studied 
in networks with network coding. Reference ||241 studies the 
transmission cost of distributing a source file among nodes 
on the network with or without network coding. It is shown 
in 11241 that by file splitting the optimality can be achieved. 
Here we assume a source file is already distributed among 
different nodes. Our objective is to find the optimum repair 
process when a storage node fails, i.e., how to regenerate a 
new node with the minimal cost. It may worth to note that 
our approach holds for every network topology. The algorithm 
takes the cost of transmitting one unit of information between 
each pair of nodes as its input and find the optimum cost 
repair. In addition, we try to find a closed form for the repair 
cost for some specific topologies like tandem, star, grid, and 
fully connected networks. 

The remainder of the paper is organized as follows. In 
Section [Til we will give a motivation example by a specific 
network. For more general networks, we formulate optimiza- 
tion problem in Section Hill In Section IIV1 we discuss the 
optimal-cost repair when the storage capacity per node is 
a = M jk. In Section IVl we investigate the fundamental lower 
bound of repair costs in tandem, star, grid, and fully connected 
networks and present the gain of the optimal-cost repair in 
those networks. Finally, in Section [VI] we conclude the paper. 





TABLE I 




Notations 


Symbol 


Definition 


n 


number of storage nodes in a distributed storage 




system 


k 


minimum number of nodes needed to- 




reconstruct the original file 


d 


number of surviving nodes for a repair process 


a 


storage amount of individual node 


P 


amount of data downloads from each 




surviving node in the repair process 


M 


original information file size 


s 


M X 1 vector denoting the original file 


£: 


M X a matrix denoting coding 




coefficients of node i 




a X 1 vector denoting the content of node i, 


i.e., Xi = Of 8 




&non — opt 


repair cost by non-optimized-cost approach 




(repair-cost without SNC) 


<Tc 


repair cost by optimized approach 




(repair-cost with SNC and optimization) 


9c 


optimization gain, g c = cr„ on - op t /a c 



II. Motivating Example 

We first give an example to illustrate the motivation. Con- 
sider a distributed storage system in a four-node tandem 
network shown in Fig. Q] Here we follow the notation of 
the regenerating codes Q, (6). The notations are also given 
in Table U We divide a source file into equal size fragments 
(packets). A file of size 4 mega-bits (M — 4 fragments; One 
fragment has 1 mega-bit) is coded with a regenerating code 
[5| and distributed among 4 nodes (n — 4). Each node stores 
2 one-mega-bit fragments (a = 2) and the source file can be 
reconstructed by any 2 nodes (k = 2). When a node fails (say 
node 4), a new node downloads /3 fragments from each of 3 
surviving nodes (d = 3). 



,cut 

node 1 i 




Fig. 2. Cut-set bound analysis in the information flow graph. Node 4 fails 
and node 5 is the new node. There exist direct links from surviving nodes to 
the new node. 



If the new node has direct links to all the surviving nodes, 
the distributed storage system can be represented by a directed 
acyclic graph with a failure/repair on node 4, as shown in Fig. 
[2 The graph is known as an information flow graph 0. In 
the graph, there is a source node (S) connected to storage 
nodes through infinite-capacity links. Each storage node is 
depicted by input (in) and output (out) nodes connecting by 
a a-capacity link. When a node fails, a new node downloads 
f3 fragments from each node. A DC denoting a data collector 
reconstructs the source file from any k storage nodes. 

By cut-set bound analysis on the information flow graph, 
it is shown that there is the fundamental storage-bandwidth 
tradeoff to regenerate a new node |]5] . A cut in the flow graph 
refers to a set of edges in which network nodes are divided 
into two separated parts; one part contains S and another part 
contains the DC. The value of a cut is the sum of capacities 
of edges from the source to the destination direction. Then, 
the corresponding cut must meet a fundamental requirement 
resulting from min-cut max-flow theorem for multicast net- 
works [8|. The theorem expresses that in a multicast network 
destinations can reconstruct the source file whenever all the 
cuts between source and destinations are greater than or equal 
to the source file size. 

By the cut-set bound analysis of the information flow graph 
in Fig. |2 we can see that for a DC to reconstruct the source 
file, it requires a + 2/3 > M. For a = 2, M = 4, then /? > 1. 
That is, the new node must download at least (3 — 1 mega-bits 
from each surviving node. The optimal-traffic repair to achieve 
this lower bound is to download pi,p2,P3 as in Fig. Q]from 
node 1, node 2 and node 3, respectively. Here pi, p^ and p% 
are formed by linear coding at nodes 1, 2, and 3, respectively. 
Yet, the analysis is different if we consider the link cost. 

We assume that each fragment of 1 mega-bits transmitted 
in a single channel costs one transmission unit. To reach the 
new node (node 5), we can easily see that p\ passes the route 
(node 1 H» node 2 M> node 3 n- node 5) with a cost of 
3 units, and p2 passes node 2 M> node 3 M> node 5 with a 
cost of 2 units, and ps passes node 3 i— > node 5 with a cost 
of 1 unit. Thus, the total cost in the repair is 6 units. In this 
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Fig. 3. Regenerating by surviving node cooperation in a tandem network. Here, for regenerating a new node, fragment p± is formed by combining fragments 
of node 1. Fragments P2,ps are formed by linear combination of fragment pi and stored fragments in node 2. Finally, fragments ps,P4 are formed by linear 
combination of P2,P3, received fragments on node 3, with the stored fragments on node 3. 



paper, we call non-optimized-cost repair to the approach which 
minimizes the repair-bandwidth (without optimizing the cost) 
and the corresponding cost is denoted as a non — op t). 

Non-optimized-cost repair may be optimal in term of repair 
bandwidth. Yet it might not be optimal in terms of the repair 
cost. For example, consider the repair scheme in Fig. [3] where 
we allow surviving node cooperation (SNC). Here the coop- 
eration means that one surviving node can combine/encode 
the coded symbols of another node. That is, surviving nodes 
are allowed to linearly combine their own fragments with the 
received fragments from other nodes. For instance, at node 

2, p2 and p% are encoded from the received p\ from node 1 
and stored fragments in node 2. At node 3, p4 and p$ are 
obtained by encoding p2 and p% with the fragments of node 

3. Note that here we only consider functional repair, in which 
the regenerated node may not be identical to the failed node 
but it has the same code property. That is, with the new node 
(node 5), any 2 out of 4 nodes can reconstruct the source. 
It is easy to see that the cost of repair is reduced to 5 units 
as shown in Fig. [3] (only two fragments are transmitted from 
node 3 to node 5). The example shows SNC can reduce the 
transmission cost. We note that the repair approach in Fig. [3] 
still is not optimal in term of the transmission cost. We shall 
show in the next section that the optimal repair cost in this 
example is 4. We further note that SNC can be applied to the 
scenario of one or multiple node failure, but the cooperative 
regenerating codes in iTPTl and ||221 can only be used for the 
scenario of multiple node failure. 

III. Problem Formulation 

In this section, we first introduce the cost matrix which 
presents the cost of transmitting one unit of data (e.g., one 
fragment) between any pair of helper nodes. In our approach 
surviving nodes cooperate in the repair process. Thus, we 
assume storage nodes are capable of performing linear cal- 
culation in finite fields. The number of helper nodes, which 



is denoted as d, is assumed to be greater than k, i.e., d > k 
(for d < k successful regenerating is not possible; see details 
in (5)). Furthermore, we consider the network structure in 
the information flow graph and formulate the optimal-cost 
problem. Our main tool for analysis would be cut-set bound 
analysis in multicast networks, proposed in seminal paper [28 1. 
In the literature, the process that a node fails and a new node 
is regenerated is called a stage of repair. Evolving network 
in a distributed storage system (infinite time leaveing/joining 
of nodes) implies infinite number of repair stages. Cut-set 
bound analysis with considering infinite stages of repair in 
a heterogenous network even for a distributed storage system 
with small number of storage nodes would be complicated. We 
thus try to find a lower bound of repair-cost by cut-set bound 
analysis in the first stages of repair. We later show that this 
bound can be achieved for the minimum storage regenerating 
codes, i.e., a — M/k. In the literature, the process that a node 
fails and a new node is regenerated is called a stage of repair. 



A. Network Setup 

Above, for a specific network, we have studied the repair 
cost and proposed SNC to reduce the cost. Naturally, we may 
ask what is the optimal-cost and how to achieve the optimality 
for more general scenarios. In this section, we first formulate 
a linear optimization problem which establishes a fundamental 
lower bound on the repair cost for general networks. 

Assume in a distributed storage systems with n nodes, node 
n fails and d number of surviving nodes (k < d < n — 1) 
help to regenerate a new node. We also assume that network 
topology and the cost of transmitting one unit of data between 
nodes are given. From the network topology the pathes from 
helper nodes to new node are known. To study the optimal- 
cost repair problem for a given network, assuming failure on 
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node n, we define andxn cost matrix C_, as follows, 

/ C(i2) . . . C(i„) \ 
C(21) ... C(2 n ) 
. . . . 

\ C(dl) C(dn) J 

where an element cuj\ denotes the link cost of transmitting 
one unit fragment from node i to node j. For instance, C(i2) is 
the cost of transmitting one fragment from node 1 to node 2, 
and C(j n ) is the corresponding cost from surviving node i to 
the new node (assuming node n fails). cuj\ is nonzero if there 
is direct link from node i to node j. Otherwise, cuj) — oo. We 
assume that an algorithm extracts the matrix C_ from a given 
network topology and link costs. Clearly, by matrix C_, we can 
calculate the costs of a path (maybe multi-hop) between any 
pair of nodes in the network. Here, we assume there is at least 
one path between any pair of nodes. Furthermore, we assume 
that the network is delay-free and acyclic. In this paper, we 
only consider linear costs. That means if the transmission cost 
of one fragment from node i to j is c^, then it costs mcuj) 
to transmit m fragments from node i to j. 

To investigate the repair-cost in a given network, we modify 
the information flow graph in J5] by introducing the network 
topology and link costs into the graph. 

B. Modified Information Flow Graph 

Consider a storage system with the original file of size 
M distributed among n nodes in which each node stores a 
fragments and any k of n nodes can rebuild the original file. 
We denote the source file with an M x 1 vector, s. Then, 
the code on each node i can be represented by a matrix 
Q.- = (<7-j ' ' ' il") °f s i ze M x a where each column (qj) 
represents the code coefficients of fragment j on node i, and 
then the stored data in node i is 2Li — Q T §.- 

We can denote the flow of information in a distributed 
storage system by a directed acyclic graph G(n, k, a) = 
G(N, A), where N is the set of nodes and A is the set 
of directed links. Similar to (5), graph G(n, k, a) consists 
of three different nodes: a source node, storage nodes and 
several data collectors (DC). The source node distributes the 
original file among storage nodes along with the (assumably) 
infinite-capacity links. Every storage node can be denoted by 
input (in) and output (out) nodes connecting by a link of 
capacity equals to the amount of node's storage (a). Finally, 
the DC reconstructs the original file by connecting to at least 
k storage nodes via the infinite-capacity links and then solving 
linear equations. Contrary to 0, there might not exist direct 
links from surviving nodes to the new node and storage nodes 
may relay other nodes' fragments to the new node. When a 
node fails, d surviving nodes participate in the repair process 
(k < d < n — 1). Here, an optimization algorithm shall 
determine the optimum traffic on the links of the network. We 
note that in distributed storage networks, node failure and new 
node generation may happen infinite times. In a heterogenous 
network, considering all stages of repair makes the problem 
complicated. Hence, in our study we focus on the first stage of 
repair, which is a performance lower bound for infinite repair 
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Fig. 4. Modified information flow graph for the first stage of repair in 
a tandem network. The cut mentioned on the figure with heavy dotted-line 
corresponds to the inequality 2(35) + a > M. 

stages. As an example, the modified information flow graph 
for the first stage of repair on node 4 has been shown in Fig. |4] 
In this example, nodes are connected in a tandem network. 

We use a column vector to denote the number of fragments 
transmitted on the directed links. This vector is termed as a 
subgraph (z — [ z (ij)]\ (ij)eA ) [23], where each element z^ is 
a non-negative integer and represents the number of fragments 
from node i to node j for (ij) £ A. For the subgraph z_ — 
[z(ij)]\ (ij)€A , and defining c^) as the cost of transmitting one 
unit of data (e.g., one fragment) from node i to node j, the 
repair cost can be formulated as, 

0-cfe) = ^2 c (ij) z (ij)- W 
(ij)eA 

Our objective is to minimize the cost (a c ) considering only 
the first stages of repair. 

Remark 1: We note that in the repair problem there would 
be infinite number of repair stages. However, cut analysis in 
the first stage of repair gives the necessary conditions for 
finding the optimum-cost regenerating approach. The analysis 
gives a lower bound of repair-cost for the repair. Later, we 
show that this lower bound can be achieved for a — M/k. 
Thus, in our analysis there would be limited number of cut-set 
constraints. Clearly, the number of cut constraints depends on 
the number of nodes n and k, 

C. Cut-set analysis in the first stage of repair 

1 ) Constraint Region: In the repair process, it is required 
that any k nodes can reconstruct the original file. This property 
is known as the regenerating code property (RCP). The RCP 
must be preserved for an arbitrary number of stages of repair. 
For this, it is necessary for the first stage of repair all the 
cut constraints to be satisfied. Thus, we find the minimum 
cr c under the constraint that in the first stage of repair, all 
cuts of connecting the DC to the new node and k — 1 
other nodes must be greater than or equal to M. In Fig. |U 
the heavy dotted line presents a cut when the DC connects 
to the new node and node 1 for a regenerating code with 
parameter k = 2. The cut constraint relating to this cut can 
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be formulated by the inequality: Z( 35 ) + a > M. By assuming 
vector z = [z(i2)^(23)-2 ; (35)] T j we can express the inequality in 
vector space as, (0, 0, l)z > M — a. Assume r cut constraints 
on the first stage of repair. Denoting \A\ the cardinality of 
existing edges between nodes, we form all the inequalities in 
a matrix form, by defining an r x | A\ dimension matrix L (this 
matrix is called coefficient matrix in the literature [27 1). The 
corresponding inequalities induced by the cut constraints show 
a region in a multi-dimensional space that the subgraph must 
satisfy to be a feasible solution. This region is often called 
poly tope l27l . Consequently the polytope is 



* = u 



(2) 



[z(ij)\ | Zft) >0,Lz>6}, 

where the comparison of two vectors e.g, a>b means every 
element in a is greater than or equals to the element in 6 at 
the same position. 

Example 1: In Fig.|H if the DC connects to node 1 and the 
new node to rebuild the source (k = 2), the first cut constraint 
is 

2(35) > M - a. (3) 

The second constraint follows if we connect the DC with node 
2 and the new node, 



2(12) + 2(35) > M 



(4) 



Similarly, when the DC connects to node 3 and the new node, 
we have the third constraint 



2(23) 



> M - a. 



(5) 



Thus, we can form all these inequalities in a matrix form as 
follows, 



(6) 











2(12) 




' M - 


a 


1 





I) 


2(23) 


> 


M - 


a 





1 




. Z 05) . 




M - 


a 




L 




Z_ 




b 





Remark 2: We note that in the cut-set bound analysis there 
would be some cut constraints that do not affect the polytope 
region. These are called non-active constraints. In the example 
above we only consider the active constraints. 

Remark 3: The polytope ^ is restricted by linear inequali- 
ties. Hence, if Z(jj)S are real numbers then the constraint region 
^ is convex. We can reasonably assume that Zuj\S are real 
numbers. Note that the file is measured by bits (integer) but 
it is normally quite large. Hence we can consider Z(ij)S as 
real values since one fragment has lots of bits. Following this 
assumption, constitutes a convex region. 

2) Linear Optimization: Since the constraint region and ob- 
jective function in the repair problem are linear, the problem is 
a linear optimization problem which can be solved efficiently. 
Finally, we can formulate the optimization problem as, 

minimize a c (z) = T,(ij)eA c {ij) z (ij) 
subject to 



L z > 6, 



(7) 



>0. 



The linear programming in ([7} results in a lower bound of 
repair costs. 



Proposition 1: The repair-cost calculated by problem Q is 
the lower bound of the repair-cost. 

Proof: In a general network, cut-set bound analysis con- 
sidering infinite stages of repair is needed to have all the con- 
straint on 2(y )S. This is the necessary and sufficient conditions 
on zuj) for the repair. Thus, cut-set bounds considering only 
the first stage of repair gives only necessary conditions, and 
in general they might not be sufficient conditions. Denote the 
polytope resulting from cut-set analysis on the first stage as 
Sfr 1 , and the polytope given the cut-set constraints till stage t 
as for f as a positive integer number. We have VP 4 C vff 1 , 
because for any z 6 satisfying constraint till stage t, z 
have to satisfy the cut constraints at first stage, i.e., z € \f rl . 
Denote cr* pt as the minimum value of a c satisfying all the cut 
constraints till stage t. In other words, al pt corresponds to the 
minimum a c on which plane a c = 2~2uj)^A c (ij) z (ij) intersects 
polytope vp*. Since cr^s are non-negative real numbers, 
\J>* c vp 1 , the minimum-cost given the cut constraints till 
stage 1 is not greater than the minimum-cost given the cut 
constraints till stage t. That is <r* pt ^ Copt- ' 

3) Example for the 4 node tandem network: Now we have 
enough tools to find the optimum repair cost in the motivating 
example (Fig. Q3. We assume three nodes joining the repair 
process (d = 3), M = 4, k = 2, a = 2, and the corresponding 
cost matrix is, 



1 oo oo 1 
G= | oo 1 oo 
,oo oo 1 



(8) 



Clearly there exists a direct link between nodes 1 and 2 with 
one unit cost, and similarly between nodes 2 and 3, and nodes 
3 and 5 (new node). Now we analyze the constraint region for 
M — 4, a = 2 in ©. Hence, we can formulate the problem 

as 



minimize 



subject to 




2(12) + 2(23) + 2(35) 



2(12) 




" 2 " 


2(23) 


> 


2 


. Z (35) . 




2 



(9) 



Solving the linear optimization problem (e.g., by a simplex 
method ll27l ) gives the optimal subgraph (z(i2), 2(23), 2(35)) = 
(0, 2, 2) with a cost of 4 units. A linear network coding by 
selecting coefficients (in this example) from GF(5) with SNC 
in Fig. [3] can meet the minimum-cost subgraph. The coding 



scheme is p = 0, p 2 = 2a 2 
P 9 + {Qli +h+a 2 + b 2 ) = a x 



£3 = 



2b 



3. Ea 



3a 2 + 2b 2 and p 



P 3 + (0.1 + 2&i + a 2 + 2b 2 ) = a x + 2b x + 2a 2 + 46. 
are fragments for the new node, which satisfies RCP. 



2- Here £4>P 5 



IV. Optimal-Cost Code for the Minimum Storage 
Network (a = M/k) 

In this section, we show that the lower bound of the repair- 
cost is achievable for a = That is, there exists a linear 

k ' 

network code corresponding to the repair with the minimum- 
cost subgraph from the optimization problem (|7). We call 
the codes that achieve this optimal point as the optimal-cost 
minimum-storage regenerating (OCMSR) code. Our proof is 
based on random linear codes and then we discuss the required 
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finite field size for constructing the OCMSR code. Similar to 
the method in [6], we consider the first stage of repair and then 
by induction on the number of repair stages, we generalize the 
results to multiple stages of repairs. To find the sufficient field 
size for successful regeneration, we apply sparse-zero lemma 
as follows. 

Lemma 1: Consider a multi-variable polynomial 
g(ai, 012, ocn) which is not identically zero, and has 
the maximum degree in each variable at most do- Then, there 
exist variables 71,72, ■•■,7n in the finite field GF(q), and 
q > d , such that 5(71,72, -,7n) ^ 

Proof: See proof of Lemma 19.17 in ||28l . ■ 

Suppose a source information file consists of M = ka 
fragments. Assuming each fragment , (i — 1, • • • , M) being 
a vector of elements in ¥ q we can denote the source by a 
vector s = [mi,-- - ,ELm] T - Then vector s is encoded by 
an erasure code (satisfying the RCP) to na fragments and 
distributed among n nodes such that every storage node stores 
a fragments. If denotes the stored symbols of node i, 
then X_i = Q 1 is where Q is an M x a-dimensional matrix 
representing the coding coefficients of node i. When a node 
fails (without loss of generality, we assume node 1 using 
the encoding coefficients Q fails) the optimization algorithm 
finds the minimum-cost subgraph. Following the minimum- 
cost subgraph, the new node is regenerated by surviving node 
cooperation. Clearly, with the minimum-cost subgraph, we 
also know which nodes should encode on the directed graph. 
Then, with a proper finite field, we can find the network code 
and regenerate the new node with the coding coefficients Q , 

and Xi = Q^s. 

To maintain the RCP after the regeneration of the lost node, 
the coding coefficients (Q^) have to meet certain requirement. 
That is, for any selection of k — 1 out of n — 1 surviving 
nodes, Sfc_i = {Q ^ ,Q s }, together with the codes of 

the new node Q , the polynomial det([Q , Q , • • • ,Q ]) 
is a non-zero polynomial. In what follows, we first show 
that det([Q , Q , - • • ,Q ]) satisfying the subgraph of the 
optimization process is not identically zero and then discuss 
the required field size. 

Lemma 2: For regenerating node 1, there exist linear codes 
satisfying the minimum-cost subgraph (resulted from problem 
O) such that the polynomial det([Q , Q , ••• , Q ]) is 
non-zero for any selected set 'B.k-i- That is, 

TT det([Q' Q ,.-.,Q ])^0. (10) 

- 1 - S 1 S k — 1 

{*!,••• ,a*_i}c{2,- ,«} 

Proof: Consider Sfc = {Q , • • • , Q } a set of coding 
coefficients selected from k out of n nodes. Since every 
k nodes can reconstruct the original file, then the matrix 
[Q > ' " " > Q ] has full rank M = ka. Thus, for Hfe_i, the 
matrix [Q , • • • , Q ] has rank (k — l)a. To regenerate 
a new node with the RCP, the minimum-cost subgraph of 
the information flow graph meets the requirement that by 
connecting the DC to the new node and any k — 1 other 
nodes, all the cuts are greater than (or equivalent to) M. This 
requires the matrix of coding coefficients of the cut in the 
modified flow graph containing the new node and selection 



node set having full rank M. To prove this, consider a set V 
containing the data collector, in and out nodes of the new 
node, and out nodes of the nodes in set S^-i. Other nodes 
including the source node are in the complement set V. The 
cut passes the in-out edges of nodes in Sfc_i has the rank 
(k — i)a. Since all the cuts has at least M edges, there would 
be at least R = M — (k — l)a — a edges from V entering 
V. In V, there exist a vectors e.g., vectors in Q which are 
independent of vectors in Hfc_j. Thus, if we send fragments 
corresponding to those a independent vectors through R = a 
edges to the V, the matrix of the coding coefficients of the cut 
will be full rank. Therefore, det([Q , Q s , ■■■ ,Q ]) can 
be non-zero. ■ 

To find the required field size of OCMSR codes, we need to 
know the maximum degree of the variables of the polynomial 
in ( [Tol l. When SNC is allowed in intermediate nodes, as in 
our repairing scheme, the polynomial degree can be greater 
than non-SNC schemes since SNC may involve extra network 
coding processes, compared to the repairing schemes only 
encoding in the storage nodes and the new node. For analysis, 
we use n nc to denote the maximum number of network coding 
processes involved by one fragment in one repair stage. We 
note that n nc < \N\, the number of nodes in the networks. 

Example 1: In the repair process of node four in tandem 
network (Fig. n nc — 4 . The network coding is at node 1, 
2, 3 and at the new node. 

Theorem 1: For a distributed storage system G(n,k,a) 
with the source file of size M, if the field size is greater 
than do, there exists a linear network code such that the RCP 
is satisfied for any stage of repair, regardless of how many 
failures/repairs happened before, where 

d =mAfn nc . (11) 

Proof: The proof is by induction on the number of repair 
stages. That is, we assume before a node fails all the storage 
nodes have the RCP. In each stage of repair, when a node 
fails, the new node is regenerated preserving the RCP. Thus, 
we initialize the code on n nodes by which any k out of n 
nodes can reconstruct the original file. Then if a node fails, 
the new node is regenerated such that the repairing cost is 
minimized and the RCP is preserved. By the RCP, the coding 
coefficients of any k nodes must have full rank M. That is, 

JJ det([Q si ,--- ,QJ)^0. (12) 

Clearly, the maximum degree of variables in (Q~2j is (zj M. 
Thus, by Lemma 1, if the field size (q) is greater than \£)M, 
then there is a network coding solution for repair. Since n nc > 
2 (at least two coding process: one in surviving nodes, and 
another in the new node), do > \T)M; thus, there is a coding 
solution for q > do- 

When a node fails (assume Qi), the optimization algorithm 
finds the minimum-cost subgraph. Accordingly, the fragments 
are combined using linear network coding, and then the new 
node is regenerated. The set including the new node (Q^ and 
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surviving nodes must satisfy the RCP. Thus, 

IJ det([Q;,Q si! ... ,0^1)^0. (13) 

{si,- ,s fe _ 1 }C{2,- ,n] 

By Lemma 2, the polynomial can be nonzero. The maximum 
degree of each variable is less than (^~^[)Mn„ c . By Lemma 
1, if the finite field size q > Mn nc , there is a network 

solution for the repair. Clearly, do = (^)Mn nc > (j~J)M« nc 
for n > k. Hence, for q > da, there exist a code for the repair. 
This concludes our proof. ■ 
In summary, optimal-cost repair for the minimum storage 
regenerating (OCMSR) code is given in two steps. First, the 
optimal-cost subgraph is found decoupled from coding by 
solving problem @. Then, we can construct the code of the 
new node by random linear coding ll25l or deterministic [26] 
from the finite field size determined by Theorem [TJ 

V. Repair-Cost Lower Bound, and the gain of 

OPTIMAL-COST REPAIR IN NETWORKS WITH GIVEN 
TOPOLOGIES 

In this section, we study the lower bound of the repair- 
cost for distributed storage systems on networks with given 
topologies. We aim at showing how considering networks and 
cost together result in a lower cost in the repair. We first 
apply our method to tandem, star and grid networks, where 
there might not exist direct links between new nodes to all 
the surviving nodes. Next, for the scenario that the new node 
has direct links to surviving nodes (as 0, lEDX we show 
that surviving node cooperation can reduce the cost. For the 
purpose, we study the repair cost in a fully connected network. 
It may worth to note that our approach can be applied on 
any network, and for any cost on the links of the networks. 
The networks considered in this section are just examples 
to present the gain of optimal-cost repair. For simplicity, we 
assume in this section that links have unit cost of transmission, 
unless otherwise stated. We define the gain in our approach 
as the ratio of repair-cost in minimum-bandwidth approach 
(denoted as a non - op t) to the optimal-cost repair (denoted as 



' opt 



), 



9c 



) non—opt 



(14) 



A. Tandem Networks 

1) Repair-Cost Lower Bound in Tandem Networks: In a 
tandem topology, nodes are in a line. That is, each node is 
linked to two neighboring nodes, except the nodes in the line 
ends, which have only one neighbor. When a node fails and 
a new node joins, the repair traffic is relayed by intermediate 
nodes to the new node. Then we can formulate the repair-cost 
lower bound by the following proposition. 

Proposition 2: Consider a tandem distributed storage net- 
work consisting of n nodes where each node stores a frag- 
ments and every k nodes can reconstruct the original file 
of size M. Assuming the cost of a link (between adjacent 
nodes) equals to one unit, the lowest repair-cost is achieved 
by cooperation of the k nearest surviving nodes and equals to 



where [x} + = max{a;,0} 

Proof: See appendix A. 

By Proposition [2] we can calculate the gain of optimized 
cost to the non-optimized cost approach. Gain in the repair for 
one of the end nodes in line can be calculated as follows. 

Corollary 1: Consider a distributed storage system with 
parameters (M = k(n — k),a — (n — k), d = n — 1) in a 

tandem network with n nodes in order (node 1-node 2 

node n). If node 1 fails, the surviving node cooperation gives 
the gain 

n(n + l) 



2k(n-k)' 



(16) 



Proof: With the given parameters the minimum band- 
width repair is (3 — 1 |5|. Sending the repair bandwidth from 
node n to the new node costs n units, from node n — 1 to the 
new node costs n — 1 units and so on. Thus in total the repair 
cost in this approach is cr„ „_ opt = n + (n — 1) + • • • + 1 = 
n(n+i) ^ p rom Proposition |2] a c = k(n — k). Hence the gain 
would be g c = 

Therefore, for (n, k)— MDS codes with k = n/2 the g c 
tends to 2 for a large network (n — > oo). 

2) Explicit Construction of OCMSR code for Tandem Net- 
works: Previously, we proved that the optimal cost functional 
repair can be achieved by applying linear network coding in 
a large finite field. In this section, we show that the lower 
bound can be achieved even with an extra condition of exact 
repair. We give an explicit construction for the optimal-cost 
exact repair in tandem networks. The motivation is that it 
requires smaller finite field size comparing to random code 
and also it has explicit construction. We split the source file 
of size M into k fragments. We denote the source file by vector 
m = [toi?tj2 ■ • • mfe] T . To construct (n, k) — MDS code, We 
use a k x n Vandermonde matrix G as a generator matrix. 

/ 1 1 ••• 1 \ 



G = 



a i 



«2 



O-r, 
Oil 



(17) 



V a*" 1 ••• at 1 J 

where ct;S for i 6 {1, • • • , n} are distinct elements from the 
finite field GF(q). 

By the property of Vandermonde matrix, every k x k 
submatrix of G is full rank if a;S, for i 6 {1, ••• , n}, are 
distinct elements. This requires q > n. Each column i.e, 
column t.t G 1, • ■ • ,n in matrix G represents the code on 
node t. We denote the coded data on node t as v(t), then, 



v t = mi + m 2 a t - 



-m k c4 1 



[1 a t 



■a^rn- (18) 



> [k(M -{k- l)a)Y 



(15) 



By the property of Vandermonde matrix, every k x k 
submatrix of G is full rank then a data collector can reconstruct 
the source file by connecting to any k nodes. The regenerating 
process is only by linear combination. Assume nodes are 
labeled in order i.e., node 1 connects to node 2, node 2 
connects to node 1 and 3, and so on. By Proposition for 
M = k fragments, the optimum-cost repair is by transmitting 
M/k — k/k — 1 fragment to the neighbor. Assume node t, 
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for t G {1, • • ■ , n}, fails and a sequence of backward nodes 
{nodet-tj , nodet-fcj+i, • • • , nodet_i} and a sequence of 
forward nodes {nodet+i, nodet+2, • • • , nodet+fc 2 }, such 
that fci + k 2 = k, help the new node to regenerate the content 
of the failed node. 

The repair process is as follows. The new node receives 
fragments from two directions, from node t — 1, aggregating 
data from nodes nodet-ki till node t — 1, and from node 
f + 1, aggregating data from nodes nodet+fc 2 to node t + 1. 
Thus, in one direction node nodet_tj multiplies its content 
by a coefficient £t-fci from GF(q) and sends the result to node 
node t _fc 1+ i. Then node node f _fc 1+ i, multiplies its content 
by £,t-ki+i and combines the result to the received fragment 
and then sends its combined fragment to its next neighbor 
nodet_fc 1 +2- Finally node t — 1 transmits the combined 
fragment Wt-%, which is, 

w t -i = Zt-kiVt-k! +6-fei+iVt-jfe 1 +i H l-£t-i«t-i 

= 6-fci ( m i + mxxt-kt H 1 

+6-fci+i(™i + m 2 a t -k 1 +i + 
H h £, t -i{mi + m 2 a t -i + ■ 



node 1 



+ m k a h ;_l i+1 ) 



k—1 

m k a t 



(19) 

In another route, node node t + 1 sends the aggregated 
fragment to the new node. That is, node<+fc 2 multiplies its 
content by a coefficient £,t+k 2 from GF(q) and sends the result 
to nodet+fc 2 _i. Then node nodet+fc 2 _i, multiplies its content 
by Ct+fej-i and combines the result to the received fragment 
and then sends its combined fragment to its next neighbor 
nodet+fc 2 _2- Other helping nodes do the same. Finally node 
t + 1 transmits the combined fragment w t+1 , which is, 

Wt+l = £t+k 2 Vt+k 2 + £t+k 2 -lVt+k 2 -2 + ' ' ' + £f+l w t+l 

= 6+fe 2 (mi + m 2 a t+ k 2 H h mfeofcM 



+ m 2 a t+ k 2 + ■ ■ 
-6+b-i( m i + m 2 a t+k2 -i 
1- £ t+ i(mi + m 2 a t +i H h m k a^ 



lV t +k 2 -2 + ■ 

■ + m k oc h t +l 2 
+ ■■■ + m k a k t -l 2 

k-l\ 
1 )■ 



-l) 



In order to achieve exact repair, we put w t -\ + w t +\ 

-l 



v t = mi + m 2 a t 



m k af 



Thus, vector £ = [^_ fcl 
be selected such that, 

[6-fci6-fci+i • • • ,£,t+k 2 ]- 



! £t-ll 6+1) 



(20) 

v t 

(21) 
£,t+k 2 ] T should 



node 2 




/ 1 Q-t-hi a 
1 a t - kl +i a 2 t _ ki+1 

V 1 a t+k 2 a? t+k2 



a 



fe-i 

fc-1 



/ 1 \ 



.. 2 



v or 1 / 

(22) 

Since matrix A is non-singular, we can always find linear 
codes (£) that make the exact repair possible. Hence, for the 
successful reconstruction and repair process the finite field 
q > n suffices. 



Fig. 5. Optimal-cost repair in a star network with 6 nodes. 

Fig. [5] shows an example of a distributed storage system in a 
star network. In the scenario, if the central node fails, and all 
link costs are equivalent, the optimal-cost repair reduces to 
the optimal-bandwidth repair in [5|. However, if non-central 
nodes fail, the situation is different. Assuming n storage nodes 
and a source file size M, the lower bound of repair-cost for a 
non-central node is calculated as follows. 

Proposition 3: In a distributed storage system with a star 
topology, the repair-cost of a non-central is greater than or 
equals to 

77 — 2 

a c >( - + l)[M-(k-l)a} + . (23) 

n — k 

Proof: See appendix B. 

By Proposition [3] we can calculate the gain of surviving 
node cooperation comparing to the non-optimized cost ap- 
proach for non-central nodes, as follows. 

Corollary 2: Consider a distributed storage system with 
parameters (M = k(n — k),a = (n — k),d = n — 1) in a 
star network with n nodes. If a non-central node fails, the 
surviving node cooperation gives the gain g c , 

9 ^2^]—2- (24) 

Proof: With the given parameters the minimum band- 
width repair is (3 = 1 [5|. Sending the repair traffic from 
in — 2) non-central nodes to the central node costs (n — 2). 
The central node sends these n — 2 fragments plus its own 
repair. This means the central node sends (n — 1) fragments 
to the new node. In total the repair cost equals to a n on-opt — 
(n — 2) + (n — 1) = 2n — 3. For the optimal cost, we 
calculate from Proposition [3] er c = (^5f + l)lf' Substituting 
M = k(n — k) yields a c — 2n — k — 2. 



Therefore, as an example for k — n/2, g c 



tends to | for 



B. Star Networks 

In a star network, there is a central node, and any pair of 
non-central nodes must communicate through the central node. 



C. Grid Networks 

Consider a 2 x 3 grid network in Fig. [6] The optimal-cost 
repair process depends on the location of the new node. The 
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optimum cost repair for the repair in node 6 can be found by 
our approach. 

Claim 1: In the repair of node 6 in Fig. [6] the optimal- 
cost repair is 7 units corresponding to the minimum-cost sub- 
graph subgraph (z (12) , z (14) , z (23) , z (25) , z (36) , z (45) , z (56) ) = 
(0,1,0,1,1,2,2). 

Proof: See appendix C. 

In this example in the non-optimized cost approach, the 
minimum bandwidth for M = 8, n = 6, k = 4, d = n — 1 is 
(3 = 1 fragment [5|. Surviving nodes transmit their fragment 
to the new node following this approach: node 3 by the path 
nodel — > node2 — > node3 — > 6, node 2 by node2 — > 
node3 — > 6, node 3 by node3 — > 6, node 4 by node4 — > 
node 5 — > 6, and finally node 5 by transmitting a fragment on 
link node5 — > 6. This gives a n0 n-o P t = 3 + 2 + 1 + 2 + 1 = 9 
units. Thus, our approach gives a gain g c = j = 1.2. 

Finding the lower bound of the repair cost in a closed form 
for a grid network is more complicated than those for tandem 
and star networks. Yet, we can know the repair-cost in a grid 
network will not be greater than the repair-cost in the tandem 
topology if other conditions are the same, e.g., link costs, the 
number of nodes. More formally, we have 

Corollary 3: Optimal-cost repair for a distributed storage 
system in a grid network with n node coded by (n, fc)— MDS 
codes (each node stores M/k fragments) leads less repair-cost 
than that of a tandem topology if all other conditions are the 
same. That is, 

<?c 9 " d < a c tandem = [k{M — (k — l)a)]+. (25) 

Proof: The repair process in a grid network can be 
reduced to that of tandem networks if we neglect some 
available links which may reduce the cost. Thus, the cost of 
optimal-cost repair of a grid network is upper-bounded by that 
of a tandem topology. ■ 

D. Fully connected network 

In a fully connected network all nodes have direct links 
to each other. This model suits for data centers where each 
data center is connected to another one through a hierarchical 
network structure H171 . For an example, consider a network 
with 5 nodes where every pair of nodes are connected by direct 
links. We shall show that there is a gain in SNC also in fully 
connected networks. For this, first assume the transmission of 



one unit of data on all links costs one unit. In this case, the 
optimal cost approach is to transmit from surviving nodes to 
the new node (without surviving node cooperation) and the 
cost is equivalent to the approach in 0. 

Proposition 4: For a distributed storage in FigJT] with pa- 
rameters (M = 6,n = 5,fc = 3,d = 4, a = M/k) in a 
fully connected network with equal transmission cost on links, 
the optimum-cost approach is one-hop transmission, namely, 
directly from surviving nodes to the new node. 
Proof: See appendix D. 

Proposition |4] holds for a fully connected network with all 
the links having equivalent costs. In the case that there are 
different costs, then it might not be optimum-cost to follow the 
same approach (direct transmission). In this case it might be 
better to exploit the multi-hop network structure. For example, 
consider a scenario where links from surviving nodes to the 
new node have high cost, however links between surviving 
nodes are low in cost (e.g., some nodes which are located 
close together in one region want to send data to the new node 
located very far from them). In this scenario surviving nodes 
can cooperate and only send the aggregated data of size a to 
the new node. More formally, we have the following result. 

Proposition 5: Consider a distributed storage in Fig J7] hav- 
ing same parameters as Proposition |4] The only difference is 
that transmitting one fragment from surviving nodes to the 
new node costs 3 units, i.e., C( 15 ) = C( 25 j = C( 35 -) = C( 45 ) = 3. 
Then the optimal repair cost equals to 10 and the optimal-cost 
repair is through links Z( 23 ) = Z( 34 ) = Z( 45 ) = 2. 
Proof: See appendix E. 

Thus, cr opt = 2x1 + 2x1 + 2x3 = 10. Yet, the repair by 
direct transmission costs 12 units, which is suboptimal. Hence, 
the optimal-cost approach, using surviving node cooperation, 
gains j| in reducing the repair-cost. 

VI. Conclusions 

We have studied repair costs for distributed storage systems 
with network coding in heterogenous networks. We formulated 
a linear programming problem which gives the fundamental 
lower bound of the repair cost. The linear programming also 
leads us to the optimal cost approach for the minimum storage 
per node, (OCMSR) codes. We prove the existence of the 
code by the random linear code in a large finite field. We 
discussed the required field size for the existence of the code. 
To reduce the cost in networks, we proposed surviving node 
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cooperation. We also proposed an explicit construction for the 
exact optimal-cost repair in a tandem network. For specific 
networks with tandem, grid and star topology, we give the 
closed-form gains of optimal repair. We also discuss the gain 
of our approach for networks with fully connected toplogy. 

VII. APPENDICES 
A. Proof of Proposition 2 

Intuitively, every k nodes can reconstruct the original file. 
Thus, if k nodes are allowed to combine their fragments 
(possibly using SNC) then they can regenerate the new node. 
When the cost of transmission on each channel is the same, the 
k closest neighbors to the new node use the least transmission 
costs on the network. Thus, they may be the optimal-cost 
solution. A stricter proof is as follows. 

Assume n nodes are labeled from 1 to n as shown in Fig. 
[U Without loss of generality, we assume node 1 fails and the 
new node (1 ) is regenerated. We denote the constraint region 
in a general form and then find the minimum repair cost value. 
We obtain the constraint region by the following steps: 

Step 1: connect the DC to the new node and nodes 
n, • • • , n — k + 1, as shown in Fig. [8] Then the cut constraint 
is, 

2(2i') + - 1)" > M => z (2i') >M-(k- l)a. (26) 

Step 2: connect the DC to the new node, node 2 and nodes 
n, ■ ■ ■ , n — k + 2. Then the cut constraint is, 

^(32) + (k - l)a > M => 2 (32) > M - (fe - l)a. (27) 

Step 3: connect the DC to the new node, node 2, • • • — l 
and nodes n, ■ ■ ■ , n — k + j — 1 for j = 3, • • • , k + 1. Then 
the cut constraint is, 

Ztfj-i) +(k- 1)q > M => > M - (fe - l)a, (28) 

Other constraints of connecting DC to the new node and k — 
1 surviving nodes are non-active constraints Q in the problem. 
Finally, the active constraints are 



2 ( 2i') > M-(k-l)a, 
Z( 3 2) >M-{k- l)a, 

Z(k+ik) > M -(k- l)a, 

Z(k+2k+l) > 0, 



Cut 



node n 



(29) 



Z(n-ln-2) ^ 0) 
> 0. 

Therefore, summing both sides of inequalities yields, 

CT C = z (2i') + z (32) H H«(nn-1) > k(M - (k - l)a). (30) 

'A non-active constraint is a constraint that does not affect or change the 
constraint region. 




Fig. 8. Cut in tandem network. This figure corresponds to the cut of z 21 i 
(fc-l)a>M. 



B. Proof of Proposition 3 

We draw the information flow graph, as shown in Fig. |9] 
and prove the results by the following steps: 

• Step 1: connect the DC to the new node, and nodes 
n, ■ ■ ■ , n — k + 1, as shown in Fig. [9] Then the cut 
constraint is, 

2 (21 ') + (fc-l)a > M, « ( 2i') > M-(k-l)a. (31) 

• Step 2: connect the DC to the new node, and node 2 and 
(fc — 2) other nodes. We see for all (^2) selections of 



other nodes, each zi 



Rij) a PP ears (fc_a) 
both sides of the constraints, we have 



Thus if we add 



(Zr, 



-Z32) 



n-2 
fe-2 



(fe-l)a > 



Z (2l' 



> 



<n-2\ 
\k-2) 
fn-S\ 

\k-2J 



n-2 

fe-2 
(3'2) 



(M - (fe - l))a. 



(33) 



• Step 3: Any other cut constraint is non-active constraint, 
therefore, 



( n ~ 2 ) _ 9 

° c > (^+l)(M-(fe-l)a) = (— - 



-l)(M-(k-l)a). 

(34) 



C. Proof of Claim 1 

The corresponding cost matrix (C) for the repair on node 6 
is as below. We formulate the linear optimization problem as 
follows. There are = 10 active cut constraints. Fig. [TOl 

shows one of these cut constraints. 



C 



(0 


1 


00 


1 


DC 


oo\ 


00 





1 


OO 


1 


OO 


00 


00 





OO 


OO 


1 


00 


00 


OO 





1 


00 


\oo 


00 


00 


OO 





1/ 



(35) 
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Fig. 9. Cut analysis in star network, node 2 is central node. This figures 
corresponds to the cut constraint of 2 ( 2 i') + ~ 1) Q :2 M. 




Fig. 10. Cut analysis in the 2x3 grid network with n = 6, k = 4. The cut 
in this figure corresponds to the inequality, 2(56) ^ M — 3a 
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(36) 



and z 



Z\i 
Z-2?, 
Z25 
Z36 
z 45 
\Z5SJ 



where .1 is a 10 x 1 vector with all the entities equal to one). 
We note that matrix H is resulted from cut-set analysis in 
the first stage of repair. For example the cut mentioned in 
Fig. [10] constructs the 8-th row in matrix H, corresponding to 



inequality 



''(56) 



> M — 3a. 



Solving this linear optimization problem (e.g. by the sim- 
plex method) for the M = 8 and a — 2 results: 

cr c = 7; 



( z (12), 2(14), 2(23), 2(25), 2( 36 ), z (45) , 2 (56)) 

(0,1,0,1,1,2,2).B 



D. Proof of Proposition 4 

The cost matrix corresponding to regenerate the new node 
on node 5 equals to, 
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(37) 



The cut analysis in first stage of repair give the following 
linear programming problem, 



min cr c (z ) = l T z 
s.t. H.z> (M - 2a)l, 
Z(ij) > 0, 



(38) 
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where 1 is a column vector with 10 elements all equal 
to 1, and T indicates vector transpose, and vector z = 

( z (12)i z (13)> z (14)i Z(15), 2(23) i z (24)> z (25)y z (34), z (35)y z (45))> 

and H_ calculated from cut set analysis equals to: 



H = 



Solving the linear programming problem for M = 6, a = 2 
gives the minimum repair cost as, Z( 15 ) = Z( 2 5) = Z( 35 ) — 
Z(45) = 1 and zero for other links. This shows in this example 
the minimizing repair cost approach gives the same minimum 
value as minimizing repair bandwidth. 



E. Proof of Proposition 5 

The cut-analysis is same as Proposition 4. The only differ- 
ence is in the cost function. Thus, the optimization problem 
would be, 



min cr c (z) = (1, 1, 1, 3, 1, 1, 3, 1, 3, 3)z 
s.t. H .z>(M - 2a)l, 
z (ij) > 0, 



(39) 



where H_ , M, a are same as Proposition 4. Solving the linear 
programming problem gives the optimum solution as Z( 2 3) = 
z {34) = z {45) = 2 and a c = (1,1,1,3,1,1,3,1,3,3)0 = 

z (23) X 1 + 2(34) X 1 + Z( 45 ) X 3 = 10. 
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